CDS

Accession Number TCMCG074C20225
gbkey CDS
Protein Id KAF8401832.1
Location join(12770503..12770732,12770945..12771160,12771294..12771405,12772793..12772855,12772940..12773078,12777865..12777923,12779962..12780035,12780212..12780359,12780612..12780785,12781559..12781646,12781754..12781870,12782314..12782411,12782507..12782606,12782756..12782850,12784794..12784900,12785019..12785112)
Organism Tetracentron sinense
locus_tag HHK36_012778

Protein

Length 637aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA625382, BioSample:SAMN14615867
db_source JABCRI010000008.1
Definition hypothetical protein HHK36_012778 [Tetracentron sinense]
Locus_tag HHK36_012778

EGGNOG-MAPPER Annotation

COG_category F
Description phosphoribosylaminoimidazole carboxylase
KEGG_TC -
KEGG_Module M00048        [VIEW IN KEGG]
KEGG_Reaction R04209        [VIEW IN KEGG]
KEGG_rclass RC00590        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K11808        [VIEW IN KEGG]
EC 4.1.1.21        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00230        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko01110        [VIEW IN KEGG]
map00230        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map01110        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGCTTCTCCGGTGTACTTCTCCGGCATTTTGCGGTCACCAAAATAATCATATCTTTGCTTCAGATTTCATCTCTCGACCTTCCTTTCTACGGAGAAAGGAATTAGGGTTTTGGATGGAACGTTTCAGCGGGATTTCTTATTCATTGCAGCAACAACAGCAGCAGCAGAAGAAGAAGAACTGGATATCTTGTTGTCAAGATTCCTTTGAAGATCATGACAGTTCAAGCAGGAAGGTTGATTTGCCTGTTCATGGAATATCGGAAACGATTGTGGGTGTCCTGGGAGGAGGGCAATTGGGACGTATGCTATGTCAGGCAGCTTCACAAATGGCCATCAAAGTAATGGTTTTGGATCCACTTAAGAACTGCCCCGCGAGTGCGTTGTCCTATTCCCATATGGTTGGAAGTTTTGATGACAGTGCGGCCGTTCAGGAATTTGCAAAGAGATGTGGAGTATTGACTGTCGAAATTGAACATGTTGACGTTGCTACGTTAGAGAAGCTTGAACAAGAAGGAGTAGATTGCCAGCCTAAAGCCTCTACCATCCAAATAATCCAGGACAAGTATCTCCAAAAATTACATTTTTCTCAGCATGCCATTCCACTTCCTGATTTTATGCAGATAGATGATCTCGAAAGTGCAAAGACAGCAGGTGACGAATTCGGTTATCCTCTTATGATTAAGAGCAAAAAGCTAGCTTATGATGGGCGTGGAAATGCTGTCGCTAACAGCAAAGAGGAGCTTTCTTCTGCTGTATCTGCTCTTGGAGGATTCAGTCGAGGCTTATATGTTGAGAAGTGGGCATCATTTGTTAAGGTGGAGCTGGCTGTCATTGTGGCAAGGGGAAGAGACAATTCTATTTTGTGTTATCCTGTTGTTGAAACTATTCATAGGGAAAACATTTGTCACATAGTAAAGGCACCTGCTGATGTGCCATGGAAGATCAAGAAACTTGCCACTGATGTTGCACAAAAAGCTATTAGTTCTTTAGAAGGTGCTGGTGTCTTTGCGGTTGAGTTGTTTTTGACGAGGGATGGTCAGGTTTTGCTAAATGAAGTAGCTCCCAGACCTCACAATAGTGGGCATCACACAATTGAAGCTTGTTTTACTTCACAATTTGAACAGCATTTGCGTGCAGTTGTCGGTCTTCCACTTGGTGATCCATCGATGAAGACGCCAGCTGCTTTAATGTATAATATACTTGGAGAAGAAGAGGGGGAGCCAGGGTTCTATTTGGCTCAGCAACTGATTGGAAGGGCATTGAGTATTCCTGGGGCCACTGTTCATTGGTATGATAAGCCAGAAATGAGAAAGCAACGAAAGATGGGCCATATCACTATTGTTGGCCCTTCTATGGGCATAGTGGAAGCAAGGCTGAATTTATTGCTGAACAGAGAAAGTTTAGATGGCCAAATTACAGTCCCTCCACGTGTTGGGATTATAATGGGTTCTGATTCAGATCTTCCAGTCATGAGTGATGCTGCAAGGATTTTGAATTCCTTTGGTGTGCCTTATGAGGTGAGAATAGTTTCAGCACACCGGACCCCGGAAATGATGTTTTCTTATGCTTTGTCTGCTCGGGGGCAAGGCATTCAGATTATCATTGCTGGTGCTGGTGGTATGGTAGCTGCATTGACTCCCTTACCTGTTATTGGAGTCCCTGTACGTGCTTCTTCATTGGATGGACTTGACTCCCTCTTATCGATTGTGCAGATGCCAAGAGGTGTCCCTGTTGCAACTGTTGCAATAAACAATGCAACAAATGCAGGTCTGTTGGCAGTAAGGATGCTGGGGGTTGGGGATGCTGATTTACAAGAAAGAACAATCCAATACCAAGAAGACATGAAGAATGATGTCCTGGCAAAAACAGAGAAGCTGGAGACAGATGGTTGGGAAGGTTATTTAAATCCTTGA
Protein:  
MLLRCTSPAFCGHQNNHIFASDFISRPSFLRRKELGFWMERFSGISYSLQQQQQQQKKKNWISCCQDSFEDHDSSSRKVDLPVHGISETIVGVLGGGQLGRMLCQAASQMAIKVMVLDPLKNCPASALSYSHMVGSFDDSAAVQEFAKRCGVLTVEIEHVDVATLEKLEQEGVDCQPKASTIQIIQDKYLQKLHFSQHAIPLPDFMQIDDLESAKTAGDEFGYPLMIKSKKLAYDGRGNAVANSKEELSSAVSALGGFSRGLYVEKWASFVKVELAVIVARGRDNSILCYPVVETIHRENICHIVKAPADVPWKIKKLATDVAQKAISSLEGAGVFAVELFLTRDGQVLLNEVAPRPHNSGHHTIEACFTSQFEQHLRAVVGLPLGDPSMKTPAALMYNILGEEEGEPGFYLAQQLIGRALSIPGATVHWYDKPEMRKQRKMGHITIVGPSMGIVEARLNLLLNRESLDGQITVPPRVGIIMGSDSDLPVMSDAARILNSFGVPYEVRIVSAHRTPEMMFSYALSARGQGIQIIIAGAGGMVAALTPLPVIGVPVRASSLDGLDSLLSIVQMPRGVPVATVAINNATNAGLLAVRMLGVGDADLQERTIQYQEDMKNDVLAKTEKLETDGWEGYLNP